Creating Classification model

From a workbook, click Actions > ML Dashboard. The ML Dashboard page is displayed. From the Create Model section, click Classification and specify the following:

Feature	Random Forest
Model Type	Ensemble tree-based model
Best Suited For	Complex, non-linear relationships; mixed data types
Relationship Modeling	Captures non-linear patterns via aggregated decision trees
Handling of Interactions	Automatically captures feature interactions
Interpretability	Moderate – feature importances available, but overall model is a “black box”
Parameter Tuning	Several hyperparameters (n_estimators, max_depth, min_samples_split, etc.)
Data Requirements	Works with high-dimensional, noisy data; tolerates outliers and missing values
Handling Imbalanced Data	Supports class weights or balanced subsampling
Feature Importance	Built-in via mean decrease impurity or permutation importance
Library Support	scikit-learn (RandomForestClassifier), XGBoost (RF variant), Spark MLlib
Prediction Output	Hard labels and class probabilities
Deployment Readiness	Scales well; fast inference per tree; model size grows with number of trees
External Features	Easy to add new features without changing core algorithm
Community & Resources	Extensive tutorials, academic papers, and widespread industry adoption

Click Create and commence the model training process.

Was this helpful?